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Abstract 

The 3 -dimensional crystal structure of glutathione S-transf erase (GST) of Schistosoma japonicum (Sj) fused with 
a conserved neutralizing epitope on gp41 (glycoprotein, 41 kDa) of human immunodeficiency virus type 1 (HIV-1) 
(Muster T et al., 1993, J Virol 67:6642-6647) was determined at 2.5 A resolution. The structure of the 3-3 iso- 
zyme rat GST of the /i gene class (Ji X, Zhang P, Armstrong RN, Gilliland GL, 1992, Biochemistry 31: 10169- 
10184) was used as a molecular replacement model. The structure consists of a 4-stranded /3-sheet and 3 a-helices 
in domain 1 and 5 o>helices in domain 2. The space group of the Sj GST crystal is P4 3 2!2, with unit cell dimen- 
sions of a = b = 94.7 A, and c — 58. 1 A. The crystal has 1 GST monomer per asymmetric unit, and 2 monomers 
that form an active dimer are related by crystallographic 2-fold symmetry. In the binding site, the ordered struc- 
ture of reduced glutathione is observed. The gp41 peptide (Glu-Leu-Asp-Lys-Trp-Ala) fused to the C-terminus 
of Sj GST forms a loop stabilized by symmetry-related GSTs. The Sj GST structure is compared with previously 
determined GST structures of mammalian gene classes p, a, and 7r. Conserved amino acid residues among the 
4 GSTs that are important for hydrophobic and hydrophilic interactions for dimer association and glutathione 
binding are discussed. 

Keywords: fusion protein; glutathione S-transferase; human immunodeficiency virus type I; Schistosoma japon- 
icum; 3-dimensional structure 



Glutathione 5-transferases (GSTs) are a family of multifunctional 
enzymes that catalyze the nucleophilic addition of the reduced 
thiol of glutathione to a variety of electrophiles (Mannervik & 
Danielson, 1988; Armstrong, 1991; Gilliland, 1993; Rushmore 
& Pickett, 1993). They are important enzymes involved in the 
metabolism of potentially toxic alkylating agents. Cytosolic GSTs, 
dimeric proteins with a molecular mass of each subunit of about 
26 kDa, are also involved in intracellular binding and transport 
of hydrophobic ligands and participate in the synthesis of pros- 
taglandins and leukotrienes. Specific molecular forms of GST 
are known to be expressed in the cellular mechanisms of drug 
resistance (Tsuchida & Sato, 1992). 
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GSTs from a variety of organisms have been identified and 
characterized (Hiratsuka et al., 1990; Dominey et al., 1991; 
Meyer et al., 1991 ; Mannervik et al., 1992; Trottein et al., 1992; 
Mignogna et al., 1993), including the detection in a wide range 
of parasitic helminths (Brophy & Barrett, 1990; Brophy et al., 
1990). Cytosolic GSTs of the genus Schistosoma participate in 
the immunogenicity to the vertebrate host and have been sug- 
gested as potential components of a vaccine against schistoso- 
miasis (Capron et al., 1987; Sher et al., 1989). In addition, the 
development of a gene expression system employing the GST 
from Schistosoma japonicum (Sj) allows the fusion of a pro- 
tein or protein fragment to the C-terminus of the enzyme (Smith 
& Johnson, 1988; Fainsod et al., 1991; Mendndez- Arias et al., 
1992; Oshima-Hirayama et al., 1993; Slany et al., 1993). The fu- 
sion peptide or protein is coupled to GST through a linker that 
can be cleaved by a specific protease, thus providing a useful sys- 
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Trp-Ala from gp41 (glycoprotein, 41 kDa) of human immuno- 
deficiency virus type 1 (HIV-1). Muster and coworkers (1993) 
determined that this peptide fused with GST was bound to the 
human monoclonal antibody (Mab) 2F5. This sequence of gp41 
was found to be highly conserved in a variety of HIV-1 isolates, 
suggesting its potential crossreactivity to the antibodies with 
properties similar to Mab 2F5. The original goal of this work 
was to crystallize the complex of Sj GST fusion protein with 
Mab 2F5, but only a crystal of the GST fusion protein was ob- 
tained. Nevertheless, because a peptide or protein fragment by 
itself is often difficult to crystallize, the GST fusion protein rep- 
resents a great potential for the structure determination of 
smaller protein fragments. The significance of this fusion 
method is discussed further elsewhere (Carter et al., 1994). 



Fig. I* Sj GST with fused peptide in a ribbon diagram. This figure was 
produced with the MOLSCRIPT program (Kraulis, 1991). 



tern for cloning, expressing, and purifying the desired protein or 
peptide product by glutathione-based affinity chromatography. 

A number of crystal structures of mammalian GSTs com- 
plexed with a variety of ligands in the active site have been re- 
cently determined (Reinemer et al., 1991, 1992; Ji et al., 1992; 
Sinning et al., 1993; Garcia-Saez et al., 1994). The structures 
have revealed a common folding pattern with a number of vari- 
ations in the binding mode of glutathione and especially the xe- 
nobiotic substrate binding site. Crystallographic studies have 
also been instrumental in determining the structural features of 
the active site essential for catalysis and substrate specificity (Liu 
et al., 1992, 1993; Manoharan et al., 1992; Ji et al., 1993, 1994). 

In this article, the 3-dimensional crystal structure of Sj GST 
with a C-terminal fusion peptide and reduced glutathione in the 
active site is described and compared to the 3 previously deter- 
mined GST structures from the mammalian gene classes, a from 
human (Sinning et al., 1993), /i from rat (Ji et al., 1992), and 
7T from pig (Reinemer et al., 1991). Because Sj GST shares a 
striking sequence homology with the mammalian ^ GST (Smith 
etal., 1986; Hughes, 1993), this structural comparison reveals 
the shared characteristics of the amino acid residues in the glu- 
tathione binding site of both the vertebrate and invertebrate 
GSTs. The sequence of the fusion peptide is Glu-Leu-Asp-Lys- 



Results and discussion 

Of the 4 GST structures compared, only Sj GST has a mono- 
mer in its crystal asymmetric unit. For the others, the 2 mono- 
mers in a dimer-paired asymmetric unit are called subunits A 
and B. Following this practice, the Sj GST monomer in the 
asymmetric unit is also designated subunit A, and its symmetry- 
related counterpart subunit B. Unless specified otherwise, all 
structural comparisons were made with subunit A of each GST. 
The atomic coordinates of ^t, a, and tr GSTs were obtained from 
Brookhaven Protein Data Bank (identification codes 1GST, 
1GUH, and 1GSR, respectively). Furthermore, the amino acid 
residue names and numbers used in this article are for Sj GST; 
for the others, residue names and numbers are used in conjunc- 
tion with a specific GST. The coordinates of the Sj GST struc- 
ture are deposited in the Protein Data Bank (identification code 
1GNE). 

Final model of the structure 

The crystal structure of Sj GST with the gp41 fusion peptide at 
2.5 A resolution is shown in Figure 1 and Kinemage 1 . It is a 
monomer forming a perfect dimer by crystallographic 2-fold ro- 
tation (Fig. 2). It shares similar characteristics with other GSTs: 
domain 1 (residues 1-84) is a typical GST folding pattern of 
$ct$a$$, and domain 2 (residues 85-217) consists of 5 a-helices 
followed by a long stretch of loops (Fig. 3). The summary of 
least-squares refinement parameters is shown in Table I. Also, 
the 4>-\p torsion angle plot for main-chain atoms is shown in Fig- 




Fig. 2. Stereo view of a C« tracing of Sj 
GST with the fusion peptide in dimer 
form. Thick and thin lines distinguish the 
2 monomers. 



Schistosoma japonicum glutathione S- transferase fusion protein 
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Fig. 3. Amino acid sequences of GSTs. The alignments are based on structural considerations. Secondary structural elements 
are marked by enclosed lines, and common residues among all GSTs are shaded. The Sj GST sequence continues with those 
of linker peptide and the gp41 fusion peptide. 



Table 1. Summary of least-squares refinement parameters 
of Sj GST fusion protein complexed with glutathione 





Target 
value 


Final 
model 


Reflections from 6.0 to 2.3 A with /> la(/) 




8,917 


Reflections from 6.0 to 2.5 A with /> la(/) 




7,622 


Crystallographic /? a factor (6.0-2.3 A) 




0.227 


Crystallographic R factor (6.0-2.5 A) 




0.219 


RMS deviation from ideal distances (A) 






Bond distances 


0.030 


0.020 


Angle distances 


0.036 


0.038 


Planar 1-4 distances 


0.030 


0.023 


RMS deviation of bond angle (°) 




1.2 


RMS deviation from planarity (A) 


0.040 


0.034 


RMS deviation from ideal chirality (A 3 ) 


0.200 


0.269 


Thermal parameter correlations (mean/A0) 






Main-chain bend 


1.500 


1.300 


Main-chain angle 


2.000 


1.950 


Side-chain bend 


2.000 


1.269 


Side-chain angle 


1.500 


1.600 



■^-SwI/y-IFcl/EwflFJ. 



ure 4. The non-glycine residues with near-unfavorable torsion 
angles are Gin 66, which is involved in glutathione binding, 
Asn 143, which is in the loop between the a-helices 5 and 6, and 
Glu 227 in the fusion peptide. There are 2 cw-prolines at resi- 
dues 55 and 201. Furthermore, a highly ordered glutathione 
structure was determined in the binding site (Fig. 5). The asym- 
metric unit has 126 water molecules, 30 of which do not have 
direct interactions with the protein itself but with other water 
molecules. Most of the water molecules are found in the open 
region at the dimer interface, around the glutathione binding 
site, and around the /3-strands in domain 1 . 

Omit maps showed that the residues of the main Sj GST struc- 
ture are well defined. These residues have low temperature fac- 
tors (4-15 A 2 ) (Fig. 6). Temperature factors for the fusion 
peptide are relatively higher (15-30 A 2 ). Electron density for 
the linking amino acids Leu 220 and Val 221 is not clearly de- 
fined; these are the residues with hydrophobic side chains that 
are in the bulk solvent region. The electron density map indi- 
cates the possibility of alternative conformations for these 
residues. 

The gp41 fusion peptide is extended by the linker peptide (Ser- 
Asp-Leu-Val-Pro-Arg-Gly-Ser-Met; Fig. 3; see Kinemage 1) to 
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Fig. 4. ^-^ Angle plot of Sj GST with the fusion peptide. Non-glycines 
are plotted as open squares and glycines as crosses. 

interact with the symmetry-related GSTs (Fig. 7) and consists 
of a series of tight turns (Fig. 8). The loop formed by the resi- 
dues 224-229 is stabilized by the interactions between Ser 225 
and Asp 229 and a water molecule forming hydrogen bonds with 
the carbonyl O atoms of Met 226 and Glu 227 and the side chain 
of Asp 229. In addition, the side chain of Met 226 is positioned 



between those of Leu 162 and Tyr 163 of the symmetry-related 
Sj GST. The relative increase in temperature factors for the 
C-terminal residues and fusion peptides reflects the higher mo- 
bility of this portion of the structure (Fig. 6). The same trend 
was also found in the refined structure of /x GST. 

Comparison of amino acid sequences 
and tertiary structures 

Sequence alignment of Sj GST and rat n GST resulted in 92 res- 
idue pair identities, which is 92/217 = 42% (Fig. 3). Also, there 
are amino acid pairings that may be considered homologous: 
(1) Leu and He in 7 cases, (2) Glu and Asp in 5 cases, and 
(3) Lys, Arg, and His in 6 cases. Thus, the effective sequence 
homology is (92 + 18)/217 = 51%. This percentage could be 
greater with more generous criteria, such as residues with simi- 
lar polar side chains. The corresponding values for the compar- 
ison between Sj GST and a GST are 24% and 33%, and between 
Sj GST and tt GST are 27% and 32%. 

The sequence alignment between Sj GST and ^ GST shown 
in Figure 3 also indicates the switching of side-chain polarities: 
(1) positive and negative side chains in 5 cases, (2) positive and 
polar residues in 1 1 cases, and (3) negative and polar residues 
in 6 cases. These changes are accompanied by compensating in- 
teractions. For example, at the interface of the dimer pair, the 
side chain of Asp 76 from subunit A interacts with the side chain 
of Arg 88 of subunit B, whereas in p GST, the amino acid resi- 
dues are changed to Arg 81 and Asp 97, respectively. 

All 4 GSTs share similar structural characteristics: 0c*0a/3/3a 
folding pattern in domain 1, and primarily a-helices in domain 
2. To compare the tertiary structures, positional and rotational 
differences between domain 1 and domain 2 were made by su- 






Ftg. 5. The structure of glutathione. A: Stereo 
view of F Q - F c electron density map of gluta- 
thione in the Sj GST binding site contoured at 
the 3a level. B: Atom assignments. 
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Fig. 6. Plot of temperature factors of main-chain atoms N, Cct, and 
C of Sj GST with the fusion peptide. Bars at the bottom of the graph 
represent the secondary structural elements: thick bars for 0-strands and 
thin bars for cr-helices. 



perposition of selected Ca atoms (Fig. 9). The 4 /3-strands and 
2 accompanying a-helices 1 and 3 in domain I are the most con- 
served structural features among the 4 GSTs compared here and 
hence used as the reference structure. The RMS deviation after 
the first superposition of domain I of m» ir GSTs to that of 
Sj GST were 0.57 A, 0.95 A, and 0.92 A, respectively. The sec- 
ond superposition was performed to determine the RMS devi- 
ations, rotation angles, and magnitude of translation required 
to superimpose Ca atoms of domain 2 (Table 2), which indicate 
how the tertiary arrangement of domain 2 differs from one GST 
to another when domain 1 is used as a reference. It is apparent 
that, although between Sj, and tt GSTs the magnitude of 
translation required to move one domain 2 to another for the 
best fit is about 1 A, that performed between a GST and the 
other 3 GSTs is more than 3 A. This large difference in trans- 
lation is mostly due to the fact that in a GST, the a-helix 6 (and 
also a-helices 7 and 8 that follow it) is separated to a much 
greater extent from the a-helix 4, nearest and parallel to the 
2-fold rotational axis for a GST dimer, than in the other GSTs. 
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The difference in Ca atom distances between the a-helices 4 and 
6 is about 1 .2 A between a GST and the other GSTs. The rota- 
tion angle is also larger by about 2° when a GST is compared 
with the other 3 GSTs. Furthermore, another source of the struc- 
tural difference is that the a-helix 4 of Sj GST is curved by about 
15°, whereas that of /x GST is almost linear. 



Hydrophobic and hydrophilic interactions 

A common hydrophobic interface between the dimer pair can be 
observed in all 4 GSTs. The residue in the loop after the a-helix 
2 is the conserved Phe 5 J and nearby are the residues Leu 64 and 
Ala 69 that are at the end of the ^-strand 4 (Fig. 3). These resi- 
dues face the residues in subunit B of the dimer pair, Ala 89, 
Met 93, Leu 94, and Phe 132. The sequence number and iden- 
tity of residues that participate in this hydrophobic interaction 
differ slightly in the other GSTs, but the above-mentioned res- 
idues make up the core structure for the hydrophobic interaction 
between the GST dimer pairs. The region along the dimer 2-fold 
axis is a relatively open hydrophilic channel, and flanking this 
channel on both sides of the dimer is this hydrophobic interface 
acting as a stabilizing force for the GST dimer. 

The hydrophilic interactions between the dimer pair is more 
variable than the hydrophobic interactions. Still, certain key se- 
quence positions can be distinguished: Gin 66, Arg 72, Asp 76, 
Arg 88, Ser 92, and Asp 100. These residues are located in the 
a-helices 3 and 4, and together with the corresponding a-helices 
of subunit B, they are positioned nearly parallel along the 2-fold 
axis and form a hydrophilic core. The residue Gin 66 participates 
in the glutathione binding and interacts with Asp 100 of subunit 
B. The residue Arg 72 interacts with its own symmetry-related 
residue in subunit B. The residue Asp 76 interacts with Arg 88 
of subunit B. In p GST, the polarity is changed so that Arg 81 
interacts with Glu 90 and Asp 97 of subunit B. In tt GST, Arg 77 
interacts with Asp 88 of subunit B. The residue Ser 92 shows no 
direct interaction with subunit B in Sj GST; however, in the 



Fig. 7. Stereo view of a Ca tracing of 
Sj GST with its fusion peptide interact- * 
ing with symmetry-related GSTs. 
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Fig. 8. Stereo view of F { , — F c electron 
density map of the gp4l fusion peptide 
contoured at the 2.5a level. 



other 3 GSTs, this sequence position is aspartate, which shows 
interactions with a positively charged residue in subunit B. 

There are other hydrophilic residues that are conserved in all 
4 GSTs at key positions in the sequence. One of these is the res- 
idue Asp 151, which is at the beginning portion of the a-helix 
6. Its side-chain atoms are observed to interact with the main- 
chain N atoms of Leu 142 and Thr 148, which are in the loop 
region after the long a-helix 5. It would seem that the role of 
Asp 151 is to stabilize the loop region between 2 a-helices. In- 
deed, a site-directed mutagenesis study showed that the aspar- 
tate at this position in ir GST is important for thermostability 
of the protein (Kong et al., 1993). Another amino acid with a 
similar configuration is Glu 87; it is at the beginning portion of 
a-helix 4, and its side chain interacts with the N atom of Gly 83, 
which is in the 78-84 loop connecting the 2 domains of GST. 
Furthermore, aspartate residues at the 0-turn in domain 1 are 
observed to make interesting interactions with other residues. 
The side chains of Asp 59 and Lys 77 interact with each other, 
and Asp 61 is close to Lys 86 of subunit B. 

The C-terminal region of Sj GST (residues 213-217) essentially 
sits on the surface of domain 2. It is also close to the 33-35 loop, 
a shortened version of the 33-42 loop of \i GST, that links the 
0-strand 2 and a-helix 2 in domain 1. The amino acid residues 
of these 2 regions interact closely with each other. In Sj GST, 
the side chain of Glu 33 forms a hydrogen bond with the ring 
atom Ne 1 of Trp 205; in \x. GST, the analogous interaction is be- 
tween Asp 36 and Lys 210, though in this case Asp 36 is 2 resi- 
dues farther down both in sequence and spatial relationship than 
Glu 33 of Sj GST in the interaction with domain 2. The next res- 
idue in Sj GST is Arg 34, and its side chain interacts with the 
side chain of Asp 213 and the carbonyl O atom of Gly 212. The 
corresponding residue in \l GST is Met 34, and rather than its 
side chain pointing outward as that of Arg 34 of Sj GST, it faces 
into the protein core. The 33-42 loop of <x GST interacts with 
its own symmetry-related loop and residues of domain 2. In n 
GST, although the corresponding loops have similar conforma- 
tions, they are farther apart than in Sj GST. This allows the 



symmetry-related GST to move into this gap. In a GST, the 
C-terminal a-helix 9 points toward the glutathione binding site 
and runs nearly parallel with the a-helix 2. There are hydropho- 
bic interactions between He 218, Phe 219, Val 54, Leu 40, and 
lie 34. The hydrophilic interaction between Arg 220 and Asp 41 
is also observed. Unlike fi GST and tt GST, a GST does not have 
a close symmetry-related interaction at this junction; it would 
seem that the a-helix 9 is sufficient to close the gap between the 
2 domains of GST. Thus, either the close interactions between 
the 2 domains or crystal packing at this region encloses the glu- 
tathione binding site. 

Glutathione binding site 

In all 4 GSTs, an extended form of glutathione is bound to do- 
main 1 . Generally, there are 4 types of interaction with gluta- 
thione in the binding site (Figs. 5B, 10): (1) stabilization and 
orientation of the 7-gIutamate of glutathione, (2) alignment of 
the glutathione peptide backbone, (3) stabilization of the termi- 
nal carboxylate of glycine, and (4) interaction with the sulfhydryl 
of cysteine for enzymatic activation. The binding site residues 
that are common in all 4 GSTs are Tyr 6, Asn 53, Leu 54, 
Gin 66, and Ser 67 in subunit A, and Asp 100 of subunit B 
(Fig. 10A; Kinemages 1 and 2). 

Association and orientation of the 7-glutamate is accom- 
plished by either a Ser-Gln or Thr-Gln sequence that form hy- 
drogen bonds with the carboxyl group (atoms Oal and Oa2). 
With the exception of ir GST, this configuration is further sta- 
bilized by a salt bridge between the atom N of 7-glutamate and 
Asp 100 of subunit B. In ir GST, the 7-glutamyl carboxylate 
group is rotated approximately 180° primarily by a more favor- 
able salt bridge with the side chain of Arg 13. The arginine res- 
idue at this sequence location is also present in a GST. In 
contrast, leucine is the residue in Sj GST and \l GST (Fig. 3). 

Hydrogen bonds to the glutathione peptide backbone are pro- 
duced by another important dipeptide sequence, Leu-Asn in Sj 
GST and p GST, Val-Gln in a GST, and Leu-Gin in it GST, and 
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Fig. 9. Stereo views of Cot atoms of Sj GST (in thick lines) wil 
A: fi GST. B: a GST. C: tt GST. 



influenced by the presence of Trp or Phe residues. In Sj GST 
and fi GST, the carbonyl atom Ow of glutathione cysteine forms 
a hydrogen bond with the indole ring atom Ne 1 of Trp 7, and 
the backbone N atom of glutathione cysteine interacts with the 
Leu-Asn sequence. In contrast, the presence of Phe in a GST 
and v GST causes the peptide flip of glutathione cysteine, so that 
the carbonyl atom Oo) forms a hydrogen bond with the back- 
bone atom N of Val 54 in a GST and Leu 50 in t GST. Further- 
more, in comparison to the interaction at the 7-glutamate of 
glutathione, the amino acid residues that interact with the ter- 
minal carboxylate atoms of glutathione glycine are dissimilar in 
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other GSTs (in thin lines) after the superposition of domain I . 



their composition and orientation in the 4 GSTs. The Gin resi- 
due interacts with these carboxylate atoms in a GST and tt GST 
in addition to the Arg, Lys, and Trp residues. 

The orientation and alignment of glutathione thus enables the 
close association of the ring OH atom of residue Tyr 6 with the 
S7 atom of glutathione in all 4 GSTs. A notable conformational 
difference of this residue between Sj GST and n GST after the 
superposition of the binding site is the angle of ring planes. Rel- 
ative to Sj GST, this angle is —34° (+ is counterclockwise, look- 
ing away from the backbone atoms); the angle is -15° for a 
GST and - 10° for tt GST. It has been suggested that in n GST, 
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Table 2. Superposition of domain 2 after the alignment of domain I for the 4 GST structures* : 
RMS deviation, rotation angle, and magnitude of translation 







Rotating GST 




Fixed GST 




a 


n 


Sj 


1.66 A, 5.5°, 1.10 A 


1.43 A, 6.8°, 3.45 A 


1.42 A, 5.1°, 1.27 A 






1.91 A, 7.2°, 4.52 A 


1.25 A, 4.9°, 1.32 A 


a 






1.64 A, 8.5°, 3.98 A 



n With the amino acid sequences aligned, the Ca atoms of domain 1 (residues 2-6, 14-22, 27-32, 56-59, 62-65, and 67-77 of Sj GST) were all 
superimposed, i.e., the monomer structures (subunit A) of j*» or, and x GSTs were rotated and translated onto the Sj GST structure with respect 
to these aligned residues. Then theCa atoms of a-helices of domain 2 (residues 85-110, 113-125, 129-136, 149-164, 173-184, 186-192 of Sj GST) 
were superimposed between different pair combinations of GSTs. 



the side chain of Thr 13 that sits on the Tyr 6 ring enhances the 
hydrogen bonding effect and the pK a of the glutathione cys- 
teine (Liu et al., 1993). Valine is the corresponding residue in 
Sj GST, methionine in a GST, and cysteine in ir GST. 

Site-directed mutagenesis studies have revealed the importance 
of specific residues in the glutathione binding site. Manoharan 
and coworkers (1992) showed that the substitutions of binding 
site residues Arg 13, Gin 62, and Asp 96 in ir GST resulted in 
20-50-fold decreases of both the catalysis and glutathione bind- 
ing in comparison to the wild type. However, the substitution 
of Tyr 7 to phenylalanine still resulted in 27% of the wild- type 
capacity to bind glutathione, but the enzymatic catalysis was re- 
duced to less than 1 %. The substantial reduction in the catalytic 
activity due to the substitution of tyrosine was also demonstrated 
in GST (Liu et al., 1992). Furthermore, there are differing 
views on how the thiol anion of glutathione is created to pro- 
mote high nucleophilic reactivity, i.e., either the thiol of gluta- 
thione is deprotonated (Liu et al., 1992) or tyrosine OH group 
is deprotonated (Karshikoff et al., 1993). Moreover, the impor- 
tance of binding of carboxylate anions of glutathione was also 
shown by the mutation substitution at Trp 38 in ir GST, which 
decreased the glutathione binding activity (Nishihira et al. 1992). 
Nishihira and coworkers (1993) also showed that the hydropho- 
bic ligand binding that may occur away from the glutathione 
binding site caused the quenching of Trp 38 fluorescence, which 
suggested a conformational change of the enzyme. 

Although Sj GST is from an invertebrate species and f± GST 
from a mammal, the 2 proteins show remarkable similarities in 
the amino acid sequence and structure. The shared character- 
istics in the 3 -dimensional structures of Sj GST and GST ex- 
tend to the similarity in biochemical properties. Walker and 
coworkers (1993) compared the characteristics of Sj GST (26 
kDa) and Schistosoma mansoni (Sm) GST (28 kDa); whereas 
the latter showed catalytic properties similar to the mammalian 
ft class GSTs, the former gave hybridized enzymatic properties 
of /x and a GSTs, 

In the GST binding site, glutathione binds to the domain 1 
side, and there is more space on the domain 2 side for additional 
binding by an electrophilic substrate. This region may be called 
hydrophobic (H-) subsite (Reinemer et al., 1991; Garcia-Saez 
et al., 1994) because the residues in this region are generally hy- 
drophobic, but notable differences are evident. For example, 
in n GST the residues Val 9, Leu 12, He 1 1 1 , Tyr 1 15, Phe 208, 
and Ser 209 were shown to be important for the binding of 



l-(S-glutathionyl)-2,4,6-trinitrocyclohexadienate (GTD) (Ji et al., 
1993) and the diastereomers of 9-(S-glutathionyl)-10-hydroxy- 
9,10-dihydrophenanthrene (Ji et al., 1994). The corresponding 
residues in Sj GST are He 9, Leu 12, Ser 106, Tyr 1 10, Gin 203, 
and Gly 204 (Kinemage 1). Here the key difference is the change 
of Phe 208 in ^ GST to Gin 203 in Sj GST, for the latter is ob- 
served to interact with Arg 102. The side chain of Arg 102 point 
toward the binding site and glutathione. The side chain of cor- 
responding residue Arg 107 in m GST points away from the bind- 
ing site and interacts with Asp 161 that is further away. The 
combination of Arg 102 and Gin 203 in Sj GST could make the 
binding of such a molecule as GTD even more specific in Sj 
GST. 

In the a GST structure, the ring portion of the substrate 
S-benzyl-glutathione is surrounded by hydrophobic residues 
(Val 1 1 , Leu 106, Leu 107, Ala 215, Phe 219, and Phe 221). The 
binding site of a GST is less open than that of the other GSTs 
due to its C-terminal a-helix. In 7r GST, Phe 8, Pro 9, Val 10, 
Ala 102, and Tyr 106 may be the possible residues for xenobi- 
otic binding interaction. The recent structure of mouse 7r GST 
(Garcia-Saez et al., 1994) complexed with S-nitrobenzyl- 
glutathione showed the specificity of these and other residues. 
Furthermore, Tyr 1 15 in n GST is shown to be an important res- 
idue for catalytic activity (Johnson et al., 1993). The correspond- 
ing residue in Sj GST is Tyr 1 10, Tyr 105 in tt GST, and Val 1 10 
in a GST. The influence of this residue on catalysis in GSTs 
other than \i GST has yet to be determined. 



Fusion peptide and antibody reactivity to Sj GST 

The fusion peptide of gp41 forms a series of tight turns (Fig. 8; 
Kinemage 1). Muster and coworkers (1993) showed that this pep- 
tide fused to Sj GST reacted with the Mab 2F5, which strongly 
suggests a recognizable conformation in solution. Compared to 
a free peptide, the fusion peptide may be stabilized by favorable 
interactions with the surface of the GST protein. Computer 
modeling predicts that the sequence Glu-Leu-Asp-Lys-Trp-Ala 
is a part of an a-helix (Gallaher et al., 1989). Because only this 
short sequence was utilized in the fusion with Sj GST, the pre- 
dicted structure may not be expected in the present crystal lattice. 
Moreover, a recent structure of gpl20 peptide in an antibody- 
antigen complex also adopts a conformation of tight turns 
(Ghiara et al., 1994). Once determined, the larger context of 
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Fig. 10. Glutathione substrate (in thick lines) in the GST binding site. A: Glutathone in Sj GST. B: Glutathione in p GST. 
C: S-benzyl-glutathione in a GST. D: Glutathione sulfonate in r GST. 
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HIV-1 protein structures should help to clarify the conforma- 
tional significance of these peptides. 

The Sj GST has been identified as a specific target for the host 
immune response in resistant mice (Smith et al., 1986). Although 
the anti-Sy GST antibody reactivity in infected humans has not 
been conclusive (Davern et al., 1990), more work can be done 
to determine the neutralizing characteristics of the antibodies 
against Sj GST and other components in schistosomiasis. For 
example, one of the monoclonal antibodies employed (250-7) by 
Davern and coworkers (1990) has an epitope specific for the 
C-terminal residues of Sj GST (sequence GGGDHPPK). In con- 
trast, the binding by this antibody was weaker for the 26-kDa 
Sm GST with the C-terminal residues GGGDAPPK. The change 
of amino acid from histidine in Sj GST to alanine in Sm GST 
presumably has a significant effect on the epitope recognized by 
the antibody. The location of the C-terminal residues of the Sj 
GST structure presented here is at the surface of the protein; 
thus, these residues are accessible as epitopes for antibody rec- 
ognition. The binding specificity conferred by His 214 (and per- 
haps also by Asp 213) may indicate that the antibody-antigen 
interaction could be electrostatic in nature. Because the inhibi- 
tion of these and other related GSTs would offer the possibil- 
ity of combining chemotherapy and immunotherapy (Mitchell, 
1989; Trottein et al., 1992; Capron & Capron, 1994), additional 
3-dimensionaI structural studies of the proteins from parasitic 
helminths would further enhance the development of a vaccine 
against the transmission of schistosomiasis. 

Materials and methods 

Expression and purification of Sj GST 

The plasmid pGEX-2T was used to express the Sj GST fused 
with the gp41 peptide (Smith et al., 1986, 1987; Smith & John- 
son, 1988; Muster et al., 1993; Pharmacia LKB Biotechnology, 
Uppsala, Sweden) in the Escherichia coli strain MCI 061 1 . In this 
vector, the gene coding for GST is under transcriptional con- 
trol of the tac promoter. lsopropyI-/3-D-thiogalactoside (IPTG) 
was added at a concentration of 1 mM for induction of the tac 
promoter. The bacterial cells were lysed by sonication and the 
lysate was cleared by centrifugation. The cleared lysate was ap- 
plied to a 2-mL glutathione-Sepharose B column (Pharmacia 
LKB) that was equilibrated with a buffer of 20 mM sodium 
phosphate, 150 mM NaCl, and 1% Triton X-100, pH 7.3. 
Bound Sj GST fusion protein was subsequently eluted with 
20 mM glutathione (reduced), 100 mM Tris, 120 mM NaCl, 
pH 8. The elution fractions were analyzed by SDS-PAGE. Pos- 
itive fractions were pooled. Typical yield was 10-15 mg of pu- 
rified Sj GST fusion protein per liter of bacterial culture. 

Crystallization and data collection 

Crystallization of Sj GST fused with the gp41 peptide was per- 
formed with hanging-drop vapor diffusion method. A 5-jxL vol- 
ume of protein solution (10 mg/mL) was combined with 5 
of precipitating stock solution of 60% (w/v) PEG 3350 (poly- 
ethylene glycol) and 50 mM potassium phosphate, pH 6.5. This 
droplet was equilibrated to a 1-mL reservoir of the same PEG 
solution at 22 °C. Long, needle- like crystals appeared within 2 
weeks. 



The crystals have Laue symmetry of 4/mmm and belong to 
the space group P4 3 2,2 with cell dimensions a = b = 94.7 A, 
and c = 58. 1 A. There is 1 GST monomer per asymmetric unit. 
X-ray diffraction data were collected with Rigaku RAXIS-11 Im- 
aging Plate Diffractometer mounted on Rigaku RU200 X-ray 
Generator. The crystal to detector distance was set at 133.4 mm 
and generator settings were 40 kV and 70 mA. The data set con- 
tained 8,485 unique reflections with /> \a(I) to 2.50 A (86.9% 
complete), and 10,057 reflections with /> la(/) to 2.25-A 
(75.9% complete). Reflections in the 2.50-2.25 A range were re- 
corded at the corners of the imaging plates and consequently 
only 45.1% complete. The R mcTgc value was 1 1.7% for all ob- 
servations with /> la(/). 

Molecular replacement 

The structure of Sj GST was determined by molecular replace- 
ment (Rossmann, 1990) using the unmodified coordinates of the 
subunit A of the previously determined GST (Ji et al., 1992) 
as the starting model. The program package MERLOT (Fitz- 
gerald, 1988) was used in these calculations. The rotation func- 
tion gave 2 different solutions at different resolution limits. Both 
of these 2 orientations were used to calculate translation func- 
tions, but only one gave consistent peaks at various Harker sec- 
tions. At the same time, the space group ambiguity was resolved. 
This solution was also tested by program RTMAP (written by 
J.X. Ho, unpubl.) that maps crystallographic R factors and cor- 
relation coefficients over all possible translation vectors for a 
given range of orientation angles. The translation vector that 
corresponded to the best R factor (47.9%) and highest correla- 
tion coefficient (30.2%) was consistent with the results from Pat- 
terson translation function for 2,248 reflections with F Q > 3o{F Q ) 
in the resolution range of 20.0-4.0 A. This model was then used 
in a rigid body refinement after which the R factor was 30.4% 
and correlation coefficient 48.6% for the same group of data. 
The crystal packing of the molecules produces a perfect dimer 
of GST, i.e., 2 monomers relate to each other by a crystallo- 
graphic 2-fold rotation symmetry. 

Structure refinement 

After the rotation and translation search, the secondary struc- 
ture elements of the GST model were adjusted in the rigid body 
refinement performed with XPLOR program (Briinger, 1991). 
Then initial side-chain changes were made after alignments of 
the amino acid sequences of Sj GST and \i GST (Fig. 3). Direct 
changes were made for side chains similar in size and polarity 
such as He and Leu, and Met and Lys; other residues with pair- 
ings that did not share such characteristics were converted into 
alanines to produce the initial Sj GST model. Next, a simulated 
annealing calculation was performed with XPLOR using re- 
flections in the 8. 0-3.0- A resolution range. Omit maps were 
calculated for appropriate replacements of side chains and ad- 
justments of their geometries. Molecular graphics programs SI- 
DUS (Lim, 1994) and O (Jones et al., 1991) were used to make 
these changes. Later, the resolution range was altered to include 
the 6-2. 3- A data. As the refinement progressed, water oxygen 
atoms were assigned for electron density peaks greater than 3 a 
on the F Q — F c contour maps. The present Sj GST structure 
should be defined as one determined to 2.5 A resolution; al- 
though the diffraction data extended beyond 2.5 A resolution, 



Schistosoma japonicum glutathione S-transf erase fusion protein 



2243 



less than 50<7o of the data were present in the higher resolution 
range. The final refinement of the Sj GST structure was per- 
formed with the restrained least-squares refinement program 
GPRLSA (Hendrickson & Konnert, 1980; Furey et al., 1982; 
Hendrickson, 1985). To determine the structure of fused gp41 
peptide, the main structure of Sj GST was held fixed and the 
C-terminal residues were allowed to move in the simulated an- 
nealing calculations. 

Determination of the main Sj GST structure 

The Sj GST and p GST proteins share 42°/o amino acid pair iden- 
tities and many other homologous pairs (Fig. 3). One notable 
difference observed from the amino acid alignment is that Sj 
GST does not have a long loop that is present in n GST (resi- 
dues 33-42) in domain 1. Initial rigid body refinements per- 
formed without this loop resulted in the translation of as much 
as I A for some of the a-helices, indicating that the tertiary 
structures of the 2 domains of Sj GST were different from those 
of it GST. Furthermore, the 78-84 loop connecting the a-helices 
3 and 4 was modified a few times before it was stabilized and 
showed a different spatial placement with respect to the 2 neigh- 
boring a-helices than the corresponding 83-89 loop in p GST. 
Subtle structural differences were also noted, as when residue 
154 was not changed from isoleucine of n GST into alanine by 
oversight during the first several rounds of refinement. Omit 
map for this residue position did not fit well for isoleucine. 

During the initial stages of refinement, glutathione was not 
included in the model. However, F a - F c electron density map 
clearly indicated its presence in the binding site (Fig. 5A). Glu- 
tathione was added, along with additional water molecules 
where appropriate. The presence of oxidized glutathione (2 glu- 
tathiones joined by a disulfide bond) was ruled out. As higher 
resolution data were included in the refinement, i.e., reflections 
in the 6.0-2. 3-A range, additional water molecules were grad- 
ually located. 

Difficulty in the Sj GST structure refinement was encountered 
with the positioning of the C-terminal residues. The sequence 
alignment indicated that the 3 glycine residues 210-212 may be 
paired with the last 3 residues of m GST (215-217), but this left 
the additional 5 residues of Sj GST (His 213, Asp 214, Pro 215, 
Pro 216, and Lys 217) and another 15 residues of the fusion pep- 
tide without a starting model. This question was resolved by the 
refinement performed (using reflections in the 8. 0-3.0- A range) 
without the residues beyond the a-helix 8. The C-terminus of Sj 
GST was gradually rebuilt and showed the same loop folding 
pattern as that for the n GST. The presence of electron densi- 
ties for 3 glycines (210-212) supported for this folding pathway. 
The F Q - F c density in Sj GST could be traced to residue Pro 215 
without addition of water molecules. A similar difficulty was 
also encountered for the structure of \i GST (Ji et al., 1992), for 
which the interpretation of the last 17 residues was resolved 
when 150 water molecules were added. 

Determination of the fusion peptide structure 

After the Sj GST structure was determined to residue Pro 215, 
the refinement was extended to higher resolution reflections in 
the 6.0-2. 3-A range. With the addition of approximately 50 wa- 
ter oxygen atoms, the location of the fusion peptide structure 
became more evident. Its electron density was located in the 



pocket formed by the a-helices 6, 7, and 8, and near Pro 215, 
but it was not connected to Pro 215. Examination of the crys- 
tal symmetry indicated 2 possibilities: (1) the fusion peptide ex- 
tends from Pro 215 directly to this density so that it lies on the 
surface of the main Sj GST structure, or (2) it extends to the den- 
sity in the neighboring region so that the last several residues of 
the fusion peptide would interact with symmetry-related GSTs 
other than the original Sj GST in the main cell (Fig. 7). The sec- 
ond possibility was a more appropriate choice, because when an 
alanine model was placed in the fusion peptide region, the ex- 
tension of electron density was evident from the neighboring re- 
gion to Pro 215. In contrast, the direct connection of the electron 
density on the surface of Sj GST to Pro 215 could not be made 
even after many rounds of model building and refinement. Fol- 
lowing this determination, residues were added at the C-terminus 
to Arg 223. At this point in the sequence, the fusion peptide was 
now interacting with the symmetry-related GSTs. The F 0 - F c 
map indicated that the last 9 or 10 residues of the fusion pep- 
tide formed a loop structure. The residue Gly 224 initiates a turn, 
and at Met 226 another sharp turn is made. Finally, there is an- 
other turn at Asp 229 and Lys 230 (Fig. 8). 
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